CONVERGENCE OF GOAL-ORIENTED ADAPTIVE FINITE ELEMENT 
METHODS FOR NONSYMMETRIC PROBLEMS 



Abstract. In this article we develop convergence theory for a class of goal-oriented 
adaptive finite element algorithms for second order nonsymmetric linear elliptic equa- 
tions. In particular, we establish contraction and quasi-optimality results for a method 
of this type for second order Dirichlet problems involving the elliptic operator Cu = 
V • (AVu) — b ■ Vu — cu, with A Lipschitz, almost-everywhere symmetric positive def- 
inite (SPD), with b divergence-free, and with c > 0. We first describe the problem class 
and review some standard facts concerning conforming finite element discretization and 
error-estimate-driven adaptive finite element methods (AFEM). We then describe a goal- 
oriented variation of standard AFEM (GO AFEM). Following the recent work of Mom- 
mer and Stevenson for symmetric problems, we establish contraction of GOAFEM. We 
also then show convergence in the sense of the goal function. Our analysis approach is 
signficantly different from that of Mommer and Stevenson, combining the recent con- 
traction frameworks developed by Gascon et. al, by Nochetto, Siebert, and Veeser, and 
by Hoist, Tsogtgerel, and Zhu. In the last part of the paper we perform a complexity 
analysis, and establish quasi-optimal cardinality of GOAFEM. We include an appendix 
discussion of the duality estimate as we use it here in an effort to make the paper more 
self-contained. 
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1. Introduction 

In this article we develop convergence theory for a class of goal-oriented adaptive 
finite element methods for second order nonsymmetric linear elliptic equations. In par- 
ticular, we report contraction and quasi-optimality results for a method of this type for 
the problem 

-V ■{AVu) + b-Vu + cu = f, inn, (1.1) 

u = 0, ondn, (1.2) 

with ^7 C M°' a polyhedral domain, d = 2 or 3, with A Lipschitz, almost-everywhere 
(a.e.) symmetric positive definite (SPD), with b divergence-free, and with c > 0. The 
standard weak formulation of this problem reads: Find u E -f^o (fi) such that 

a{u,v) = f{v), \/veH',{Q), (1.3) 

where 

a{u,v) = / AVu ■ Vv + b ■ Vuv + CUV dx, /(f) = / fvdx. (1.4) 
Jn Jn 

Our approach is to first describe the problem class in some detail, and review some 
standard facts concerning conforming finite element discretization and error-estimate- 
driven adaptive finite element methods (AFEM). We will then describe a goal-oriented 
variation of standard AFEM (GOAFEM). Following the recent work of Mommer and 
Stevenson [ fTO| for symmetric problems, we establish contraction of GOAFEM. We also 
show convergence in the sense of the goal function. Our analysis approach is signficantly 
different from that of Mommer and Stevenson [TO], combining the recent contraction 
frameworks of Gascon et. al [4J, of Nochetto, Siebert, and Veeser [llj, and of Hoist, 
Tsogtgerel, and Zhu [8|. We also give a complexity analysis, and establish quasi-optimal 
cardinality of GOAFEM. 

The goal-oriented problem concerns achieving a target quality in a given linear func- 
tional g: Hq^H) — > M of the weak solution u E Hq(^1) of the problem (11.31) . For ex- 
ample, g{u) = j^Xui^^ the average value of u over some domain a; C fi. By writing 
down the adjoint operator, a*(z, v) = a{v, z), we consider the adjoint or dual problem: 
find z E HQ{n) such that a*{z, v) = g{v), for all v E Hq^VI). It has been shown for the 
symmetric form (b = 0) of problem (ll.ll) - (ll.2l) with piecewise constant SPD diffusion 
cofficient A (and with c = 0), that by solving the primal and dual problems simulta- 
neously, one may converge to an approximation of g{u) faster than by approximating u 
then g{u), when forcing contraction in only the primal problem IfTOl . We will follow 
the same general approach in order to establish similar goal-oriented AFEM results for 
nonsymmetric problems. However, in order to handle nonsymmetry, we will follow the 
technical approach in [9]|4l[8l, and rely largely on establishing quasi-orthogonality. In 
particilar, contraction results are established in [9, 4J for (|1.1I) - (I1.2I) in the case that A 
is SPD, Lipschitz or piecewise Lipschitz, b is divergence-free, and c > 0. In [8J, quasi- 
orthogonality is used as the basis for establishing contraction of AFEM for two classes 
of nonlinear problems. As in these earlier efforts, relying on quasi-orthogonality will 
require that we assume that the initial mesh is sufficiently fine, and that the solution to 
the dual problem a*(w, v) = g{v), g E L2{^) is sufficiently smooth, e.g. in iJ^^ ((]). 

Following |[8l, the contraction argument developed in this paper will follow from first 
establishing three preliminary results for two successive AFEM approximations Ui and 
U2, and then applying the Dorfler marking strategy: 
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1) Quasi-orthogonality ( ^3.11 ): There exists A > 1 such that 

III II|2 ^ A III III 2 III III 2 

If'" — "^211 <A.||n — Mill — ||M2 — Mill . 

2) Error estimator as upper bound on error ( §3.21) : There exists Ci > such that 

l\u-Ukf < Cir]l{uk,Tk), k = l,2. 

3) Estimator reduction ( ^3.41 ): For Ai the marked set that takes refinement 7i — 72, 
for positive constants A < 1 and Ai and any 5 > 

Vliv2,r2) < (1 + 6){r]l{v,,ri) - \v!{vi,M} + (1 + 6-')A,r]^Jv2 - v,l\. 

The marking strategy used is the original Dorfler strategy; elements are marked for re- 
finement based on indicators alone. The marked set Ai must satisfy 

TeM 

In the goal-oriented method, a second marked set is chosen based on an error indicator 
for the dual problem associated with the given goal functional, and the union of the two 
marked sets is then used for refinement. 

A main advantage of the approach in is that it does not require an interior node 
property. This allows us to establish the necessary results for contraction without taking 
full refinements of the mesh at each iteration. This improvement follows from the use 
of the local perturbation estimate or local Lipschitz property rather than the estimator 
as lower bound on error. We use the standard lower bound estimate as found in (9) for 
optimality arguments in the second part of the paper concerning quasi-optimality of the 
method. 

There are three main notions of error used throughout this paper. The energy error 
fu — Ukl\, the quasi-error and the total-error. The energy error is defined by the symmet- 
ric part of the bilinear form that arises from the given differential operator in (11.31) . The 
quasi-error is the I2 sum of the energy-error and scaled error estimator 

Qk{uk, Tk) ■■= - Ukf + Wkf"^, 

and this is the quantity that is reduced at each iteration of the algorithm. In ^the quasi- 
error is shown to satisfy 

\\u - Uk+if + 777^+1 < {\\u - Ukf + 7^fc) , a < 1- 
The total error includes the oscillation term rather than the estimator 

Ek{uk, Tk) ■■= - Ukf + osc^)^/l 

The oscillation term captures the higher-frequency oscillations in the residual missed by 
the averaging of the finite element method. While the quasi-error is the focus of the 
contraction arguments, it is the total error that will be critical to complexity analysis. 
Therefore, we will need to establish various preliminary results for both types of error. 

The quasi-optimality of the goal oriented method in ^g] is developed with respect to 
the total error which is shown to satisfy Cea's lemma. The cardinality result 

#r, - #ro < s{d) { M, M + ^ j Qi''\uk, Tk) 
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bounds the growth of the adaptive mesh with respect to the quasi-error of both problems. 
An equivalence between the quasi-error and total error is established in ^ 

A final brief comment is in order concerning the notation used here compared to that 
in [4] and the related literature. In [4J, the number of times each marked element is 
refined is denoted b. In this article, each marked element is refined once. Therefore, b 
will be reserved for the convection term in the nonsymmetric problem. The constant C 
will denote a generic but global constant that may depend on the data and the condition 
of the initial mesh To, and may change from step to step. 

Outline of the paper. The remainder of the paper is structured as follows. In ^ we 
first describe the problem class and review some standard facts concerning conforming 
finite element discretization and error-estimate-driven adaptive finite element methods 
(AFEM). In ^2.3[ we then describe a goal-oriented variation of the standard approach to 
AFEM (GOAFEM). Following the recent work of Mommer and Stevenson for symmet- 
ric problems, in ^|3]we establish contraction of goal-oriented AFEM. We also then show 
convergence in §3.61 in the sense of the goal function. Our analysis approach is signfi- 
cantly different, combining the recent contraction frameworks developed by Gascon et. 
al ^45, Nochetto, Siebert, and Veeser [llj, and by Hoist, Tsogtgerel, and Zhu [8]. In ^ 
we consider complexity questions, and establish quasi-optimal cardinality of GOAFEM. 
We recap the results in ^|5l and point out some remaining open problems. 

2. Problem class, discretization, goal-oriented AFEM 

2.1. Problem class, weak formulation, spaces and norms. Gonsider the nonsymmet- 
ric problem (11.31) . where as in (11.41 ) we have 

a{u, v) = {AVu, Vf ) + {b ■ Vm, v) + {cu, v). 

Here we have introduced the notation (■, ■) for the L2 inner-product over C M"'. The 
adjoint or dual problem is: Find z E Hq(^1) such that 

a*{z,v) = g{v) foraWv e H^{n) (2.1) 

where a*( ■ , ■ ) is the formal adjoint of a( ■ , ■ ), and where the functional is defined 
through 

g{u) = gu dx, (2.2) 
Jn 

for some given g E -L2(^). We will make the following assumptions on the data: 

Assumption 2.1 (Problem data). The problem data D = [A, b, c, /) and dual problem 

data D* = {A, —b, c, g) satisfy 

1 ) A : Q ^ M.'^^'^, Lipschitz, and a.e. symmetric positive-definite: 

ess inf ^^^\,nin{A{x)) = /ig > 0, (2.3) 

ess sup \ma.{A{x)) = /ii < 00. (2.4) 

2) b : fl ^ W^, with bk G -Z^oo(^) > (^f^d b divergence-free. 

3) c : fl ^W, with c e Loo{fl), and c{x) > Ofor all x E fl. 

4) f,gEL2{Q). 

The native norm is the Sobolev norm given by 

Wv^H^ = {Vv,Vv) + {v,v). (2.5) 
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The Lp norm of a vector valued function v over domain to is defined here as the I2 norm 
of the Lp{uj) norm of each component 




(2.6) 



Similarly, the Lp norm of a matrix valued function M over domain uj is defined as the 
Frobenius norm of the Lpioj) norm of each component 




(2.7) 



We note that one could employ other equivalent discrete lp norms in the definitions (|2.6I) 
and (12.71) . however this choice simplifies the analysis. 

Continuity of a( ■ , ■ ) follows from the Holder inequality, and bounding the L2 norm 
of the function and its gradient by the norm 

a{u,v) < {fii + \\b\\L^ + \\c\\l^) \\u\\hi\\v\\hi = Mc\\u\\hi\\v\\hi- (2.8) 

Coercivity follows from the Poincare inequality with constant Cq and the divergence-free 
condition 

a{v,v) > fio\v\]ji > CnfJ^oWvWni = ml\\v\\]ji, (2.9) 

where the coercivity constant m| := Cq^o- Continuity and coercivity imply existence 
and uniqueness of the solution by the Lax-Milgram Theorem [|7J. The adjoint operator 
a* ( , ) is given by 

a*{v,u) := a{u,v), u,v E Hq{^1). 
Integration by parts on the convection term and the divergence-free condition imply 

a*{z, v) := {AVz, Vv) - {b ■ Vz, v) + {cz, v). (2.10) 

Define the energy semi-norm by 

\\vf := a{v,v). (2.11) 
Non-negativity follows directly from the coercivity estimate (12.91 ) 

|||t;f > m|||t;||^i, (2.12) 

which establishes the energy semi-norm as a norm. Putting this together with the reverse 
inequality 

i^^lll' < ^^l\^v\l^ + ||c||i^||t;||i^ =^ \\v\\ < MeMn^, (2.13) 

establishes the equivalence between the native and energy norms with the constant = 

(1-^1 + Ml J'/'. 
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2.2. Finite element approximation. We employ a standard conforming piecewise poly- 
nomial finite element approximation below. 

Assumption 2.2 (Finite element mesh). We make the following assumptions on the un- 
derlying simplex mesh: 

1 ) The initial mesh % is conforming. 

2) The mesh is refined by newest vertex bisection ifTOl at each iteration. 

3) The initial mesh % is sufficiently fine. In particular, it satisfies Ii3.6\) . 

Based on assumptions 12.21 we have the following mesh constants. 

1) Define 

hr:=maxhT, where /it = |T|^/'^. (2.14) 

In particular, is the initial mesh diameter. 

2) Define the mesh constant 'Jn = '^Ir where 

7r = ° and /^mm = Klin 
then for any two elements T, T in the same generation 

and as neighboring elements may differ by at most one generation for any two 
neighboring elements T and T' 

liT < 27r/iT' = iNhr' (2.15) 

3) The minimal angle condition satisfied by newest vertex bisection implies the mesh- 
size hx is comparable to h^, the size of any true-hyperface a of T. In particular, 
there is a constant 7 

^<72forallT. (2.16) 
Let T the set of conforming meshes derived from the initial mesh To- Define T^r C T 

by 

T^ = {reT|#r-#ro<iv}. 

For a conforming mesh 7i with a conforming refinement T2 we say T2 > Ti. The set of 
refined elements is given by 

7^l^2 :=7^rl^r2 :=ri\(r2nri). (2.17) 

An overlay of two meshes Ti > % and T2 > % where T2 is not generally a refinement 
of 7i is given by 

Ti © 7^ := {T G ri\T C r for some T' e 7^} U {T G 7^|r C T' for some T' G Ti} 

(2.18) 

and is itself conforming. Define the finite element space 

Vr := H^{n) n H P„(T) and := Vr,. (2.19) 

TeT 

For subsets to CT, 

Yrioo) := H^{n) n J] P„(T), (2.20) 
Teui 
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where fniT) is the space of polynomials degree degree n over T. Denote the patch about 

TeT 

oot ■■= T U {T' e T \ T nV is Si true-hyperface of T}. (2.21) 

For a d-simplex T, an true-hyperface is a d — 1 dimensional face of T, e.g., a face in 3D 
or an edge in 2D. Define the discrete primal problem: Find Uk E Yk such that 

aiuk,Vk) = fivk), Vk e Vfc, (2.22) 

and the discrete dual problem 

a*{zk,Vk) = g{vk), Vk e V^. (2.23) 

2.3. Goal oriented AFEM (GOAFEM). As in [HOl the goal oriented adaptive finite 
element method (GOAFEM) is based on the standard AFEM algorithm: 

SOLVE ESTIMATE MARK REFINE . 

In the goal oriented method, one enforces contraction of the quasi-error in both the primal 
problem and an associated dual problem. As shown in section §3.6[ the error in the goal- 
function satisfies the bound 

\g{u) - g{uk)\ = \a{u - Uk, z - Zk)\ < 2|||u - Uklfz - Zkj. 

This motivates driving down the energy-error in both the primal and dual problems at 
each iteration. As noted in Q the residual-based error estimator does not exhibit mono- 
tone behavior in general, although it is monotone non-increasing with respect to nested 
mesh refinement when applied to the same (coarse) function. The quasi-error is shown 
to contract for each problem for which mesh refinement satisfies the Dorfler property. 
However, refining the mesh with respect to the primal problem does not guarantee the 
quasi-error in the dual problem will be non-increasing, and vice-versa. As such, the 
procedures SOLVE and ESTIMATE are performed for each of the primal and dual prob- 
lems. The marked set is taken to be the union of marked sets from the primal and dual 
problems, each chosen to satisfy the Dorfler property. This method produces a sequence 
of refinements for which both the error in the primal and dual problems contract at each 
step. 

Procedure SOLVE. The contraction result supposes the exact Galerkin solution is 
found on each mesh refinement. In practice a linear-time iterative method is employed 
so that the Galerkin solution is found up to a given tolerance. 

Procedure ESTIMATE. The estimation of the error on each element is determined 
by a standard residual-based estimator. The residuals over element interiors and jump- 
residuals over the boundaries are based on the local strong forms of the elliptic operator 
and its adjoint as follows. 

C{v) = V ■ (AVv) -b-Vv - cv; C*{v) = V ■ (AVv) + b ■ Vv - cv. (2.24) 

The residuals for the primal and dual problems using the sign convention in [4J are: 

R{v) := f + Civ); R*{v) := g + C*{v), v eYr- (2.25) 

While the primal and dual solutions u and z of (11.31 ) and (12.11 ) respectively satisfy 

f{z) = a{u, z) = a*{z, u) = g{u) 

the residuals for the primal and dual problems are in general different. The jump residual 
for the primal and dual problems is 

Jt{v) := l[AVv]-nj9T (2.26) 
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where jump operator | ■ ] is given by 

IMqj, ■- lim (l)(x + tn) - (pix - tn) (2.27) 

i— 5>0 

and n is taken to be the appropriate outward normal defined piecewise on dT. On bound- 
ary edges ah we have 

so that |[y4Vf ] ■ n\QT = {[AVv] ■ njarnn- For clarity, we will also employ the notation 

Rt{v) := Riv)\^, veYr, 
and similarly for the other strong form operators. The error indicator is given as 

v'Tiv,T) := h^R{v)\\l^^^^ + hf\\Mv)\\l^^,^^, V G V^. (2.28) 
The dual error-indicator is then given by 

Criw^T) := h^^\\R*{w)\\l^^^^ + hf\\Mw)\\l^^,^^, w G Vr- (2.29) 

The error estimators are given by the Ip sum of error indicators over elements in the space 
where p = 1 or 2. 

vUv):=J2vUv,T), veYr. (2.30) 
tgT 

The dual energy estimator is: 

C^H:=5^CfH, weYr. (2.31) 

TeT 

The contraction results for the quasi-error presented below will be shown to hold for 
p = 1,2 where the error estimator and oscillation are defined in terms of the Ip norm. 
While complexity results are shown only for p = 2, the contraction results for p = 1 are 
useful for nonlinear problems; see 

For analyzing oscillation, for v G V7- let 11^ the orthogonal projector defined by the 
best L2 approximation in over mesh T and = I — H^. Define now the oscillation 
on the elements T G T for the primal problem by 

oscr(^;,T) := hT\\Pl_2Ri^)\\L,iT) (2.32) 
and analogously for the dual problem. For subsets w C 7" set 

osc^(t;, tu) := ^ osc?^{v, T). (2.33) 

Tew 

The data estimator and data oscillation, identical for both the primal and dual problems, 
are given by 

r/^(D,T) := h^^ (||divAr,^(^) + h-'\\A\\l^,^^^ + \\c\\l^^^^ + \\b\\l^^^^) , (2.34) 
osc^^(D,T) := h^^ (llPr-idivAr^^(^) + Vll^r^llL(T) 

+ ^tII-P)?^2c||l^(T) + 11-^2^-2^11^^(2-) + \\P^im^{T)) ■ (2.35) 

The data estimator and oscillation over the mesh T or a subset C T are given by the 
maximum data estimator (oscillation) over elements in the mesh or subset: For to CT 

r]riD,uj) = ma,xr]r{D,T) and osc7-(D, w) = maxoscr(-D, 7")- 
The data estimator and data oscillation on the initial mesh 

?7o := riTo{D,%), and oscq := oscro{D,To). 
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As the grid is refined, the data estimator and data oscillation terms satisfy the monotonic- 
ity property JH for refinements T2 > Ti 

V2{D, T2) < m{D, Ti) and osc2(Z^, T2) < osci(D, Ti). (2.36) 

Procedure MARK. The Dorfler marking strategy for the goal-oriented problem is 
based on the following steps as in ifTOl : 

1) Given 9 E (0, 1), mark sets for each of the primal and dual problems: 

• Mark a set Mp C 71- such that, 

vliukX)>d'vliuk,%) (2.37) 

TeMp 

• Mark a set A^^^ C Ta; such that, 

J2 CU^k,T)>9'Clizk,%) (2.38) 

T€Md 

2) Let A4 = Aip U Aid the union of sets found for the primal and dual problems 
respectively. 

The set Ai differs from that in IfTOl , where the set of lesser cardinality between 
Aip and A^^ is used. In the case of the nonsymmetric problem the error reduced at each 
iteration is the quasi-error rather than the energy error as in the symmetric problem [UOl . 
This error for each problem is guaranteed to contract based on the refinement satisfy- 
ing the Dorfler property. As such, refining the mesh with respect to one problem does 
not guarantee the quasi-error in the other problem is nonincreasing. Sets Aip and Aid 
with optimal cardinality (up to a factor of 2) can be chosen in linear time by binning the 
elements rather than performing a full sort IfTOll . 

Procedure REFINE. The refinement (including the completion) is performed accord- 
ing to newest vertex bisection [2] . The complexity and other properties of this procedure 
are now well-understood, and will simply be exploited here. 

3. Contraction and convergence theorems 

The key elements of the main contraction argument constructed below are quasi- 
orthogonality l3.1[ error estimator as upper-bound on energy-norm error 13. 21 and estimator 
reduction 13.41 Estimator-reduction is shown via the local-perturbation estimate [33l The 
local perturbation of the oscillation is presented here and used in ^|4l Mesh refinements 
7i and T2 (respectively TJ) are assumed conforming, and Uj is assumed the Galerkin so- 
lution on refinement . The following results hold for both the primal and dual problems 
which differ by the sign of the convection term; therefore, they are established here only 
for the primal problem. 

3.1. Quasi-orthogonality. Orthogonality in the energy-norm — U2IP = — Mi|p— 

l\u2 — Uilp does not generally hold in the nonsymmetric problem. We use the weaker 
quasi-orthogonality result to establish contraction of AFEM (GOAFEM). The following 
is a variation on Lemma 2.1 in |[9l (see also |[8l). 

Lemma 3.1 (Quasi-orthogonality). Let the problem data satisfy Assumption 12. 1 1 and the 
mesh satisfy conditions (1) and (2) of Assumption 12. 21 Let 7i, 72 G T with > Ti- Let 
Uk G Vfc the solution to (12.221 ), k = 1,2. There exists a constant > depending on the 
problem data D and initial mesh To, and a number < s < 1 dictated only by the angles 
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— —1/2 

of dQ, such that if the meshsize ho of the initial mesh satisfies A := C^hQ\\b\\i^^Q < 1, 
then 

fu — 'U2IIP < AIIm — MilP — |||'U2 — "^llVy (3-1) 

where 

A:= {l-C.ht,\\b\\L^f^o'^')-'- 
Equality holds (usual orthogonality) when b = in Q, in which case the problem is 
symmetric. 

Proof. The proof follows close that of Lemma 2.1 in [9]. Let 

62 '■= u — U2, ei := u — Ml, and ei := U2 — ui. 
By Galerkin orthogonality 

Ifeilf = a(ei, ei) = |||e2|||^ + + a{ei, 62). (3.2) 
Rearranging and applying the divergence-free condition on the convection term 

Il|e2|||' = lieif -lleif -2(6-V£i,e2). 

Applying Holder's inequality and coercivity (12.91 ) \ei\H^ < A^o followed by Young's 

inequality with constant S to be determined, 

-2(6 ■ Vei, 62) < 6\\e2\\l^ + (3.3) 

By a duality argument for some > assuming u E for some < s < 1 

depending on the angles of dVl 

||e2||L2 < C*/ig|||e2|||. (3.4) 

The details of this argument as described in the appendix ^ may also be found in [[T]| 
and ISJ. Applying and (D to (lO) . 

(1 - 6c'X')h - U2r <i\u-u,r-(i- ip-^ i\m - U2r. (3.5) 



Choose S to equate coefficients 



then 

Assuming the initial mesh as characterized by Hq satisfies 

A=||6||L^a/^^^o'/'<l, (3.6) 

the quasi-orthogonality result holds. □ 
Note that by (|3.2I) we also have 

llki|ir = ll|ei|||'-|||e2|||'-2(6-Ve2,£i). (3.7) 

Similarly to ^3 

-2(6- Ve2,£i) > -2|(6- Ve2,£i)| > -^IkillL - ip^iesir, (3.8) 

O/iO 
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which under the same assumptions yields the estimate 

|||m2 - Mllir > (1 + - Mlf - - ^l2|||^ (3.9) 

whereA<l =^ (l + A)-i>l/2. 

3.2. Error estimator as global upper- bound. We now recall the property that the error 
estimator is a global upper bound on the error. The proof is fairly standard; see e.g. [fTOll 
(Proposition 4.1), HH (3.6), and M- 

Lemma 3.2 (Error estimator as global upper-bound). Let the problem data satisfy As- 
sumption 12. 1 1 and the mesh satisfy conditions (1) and (2) of Assumption l272l Let 7i, 72 G 
T with 72 > 7i. Let Uk G the solution to (|2.22l) . A; = 1, 2 and u the solution to (|1.3I) . 
Let 

G = G{r2,ri) := {TcTi \ Tnf y^il} for some f eTuf ^Ti}. 
Then for global constant Ci depending on the problem data D and initial mesh % 

l\u2-ui\l<Cir]i{ui,G) (3.10) 

and in particular 

l\u- uil < Ci'i]i{ui,Ti). (3.11) 

3.3. Local perturbation. The local perturbation property established in flU, analogous 
to the local Lipshitz property in [8], is a key step in establishing the contraction result. 
This is a minor variation on Proposition 3.3 in [4] which deals with a symmetric prob- 
lem. Here, we include a convection term in the estimate. In particular, (13.121) shows 
that the difference in the error indicators over an element T between two functions in 
a given finite element space may be bounded by a fixed factor of the native norm over 
the patch lot of the difference in functions. In contrast with the analogous result in 
the estimate (13.131) involves a fixed factor of the native norm over an individual element 
rather than a patch as by the continuity of A the oscillation term does not involve the 
jump residual. 

We include the proof of (13.121) for completeness. The proof of (|3.13l) may be found 
in [HH with the final result inferred by the absence of the jump residual in the oscillation 
term. 

Lemma 3.3 (Local perturbation). Let the problem data satisfy Assumption 12. i I and the 
mesh satisfy condition (1) of Assumption \2.2\ Let T G T. For all T E T and for any 

v,w E Yj- 

Vt{v, T) < r]r{w, T) + Air/r(/^, T) \\v - w\\hh^^^) (3.12) 
oscr(t^,r) <o%cr{w,T) +k20^CT{D,T)\\v -w\\m(T) (3.13) 

where recalling (12.211 ) ut is the union ofT with elements in T sharing a true-hyperface 
with T. The constants Ai, A2 > depend on the initial mesh %, the dimension d and the 
polynomial degree n. 

Proof of From (1228]) 

r^'r{v,T) := Kmv)\\l,^r) + hfWMvW,^^,^^, v G V^. (3.14) 
Denote rj-fiv, T) by ri{v, T). Set e = v — w.^y linearity 

R{v) = R{w + e) = f + C{w + e)=f + C{w) + £(e) = R{w) + C{e) 

and 



J{v) = J{w + e) = J{w) + J(e). 
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For p = 1 by the triangle inequality 

il{v,T) = hrWRiw) + /:(e)|U,(T) + hll^J{w) + J(e)|U,(aT) 

< ri{w,T) + hT\\C{e)\\L^(^T) + /^r^||^(e)||L2(aT)- 
For p = 2 using the generalized triangle-inequality 

^{a + 6)2 + (c + d)2 < + c2 + 6 + rf, fora,6,c,d>0 (3.15) 

we have 

r/(t;,T) = (4||i?(^) + £(e)||i^(^) + /.^H J(i.) + J(e) 

< r^{w,T) + /^r||£(e)|U,(T) + h}l'\\J{e)\\L,i9T). 

Consider the second term on the RHS /ir||£(e)||L2(T)- By definition (|2.24l) of £( ■ ), the 
product rule applied to the diffusion term and the triangle-inequality 

||>C(e)||L2(r) < \\d\vA ■ Ve||L2(r) + \\A : D'^e\\L2{T) + \\ce\\L2{T) + \\h ■ VcHl^ct) 

where D'^e is the Hessian of e. Consider each term. The first diffusion term 

||divA ■ Ve||L2(T) < ||divA||L^(r)||Ve||L2(r) (3.16) 

by the inequality 

11^^ ■ z\\l2(t) < \Mloo{t)\\z\\l2(t), V e Loo(T), z e L2{T). (3.17) 
Applying (13.171) and inverse-estimate to the second diffusion term 

\\A : D^e\\L2{T) < PIIloo(t) P^e||L,(T) 

<C,/iylA|U^(r)||Ve|U,(T). (3.18) 

For the reaction term 

\\ce\\L2(T) < l|c||Loo(T)||e||L2(T)- (3.19) 

For the convection term applying (|3.17l) 

VelU^m < ||&||L^(T)||Ve|U2(T). (3.20) 

Consider the the jump-residual term || J(e) \\L2{dT)- For each interior true-hyperface a = 
TnV, T,T' e Tby dm 

J(e) I := hm (AVe) (x + tn^) - lim (AVe) (x - tn^) 

= ■ (AVe) 1^ - ■ (AVe) (3.21) 

where (AVe) |^ is understood to refer to the product of the limiting value of AVe as the 
element boundary is approached from the interior of T. By the triangle-inequality 

\\J{e)\\L2{a) < ha ■ (AVe)|^||L2(a) + " {AV e)\^,\\ L^ia) ■ 

By bounds for the inner-product with a unit normal and a matrix- vector product 

U-n\\L2ia)<ML2{.), (p&L^ia), (3.22) 
||M0|U,(T) < ||M|U^(r)||0|U,(T), M G Loo(T), <f) e L2(T) (3.23) 

obtain 

\K ■ (^Ve)|^|U,(,) < ||(AVe)|^|U,(,) < ||A|^|U^(,) || Ve|^|U,(,). (3.24) 
Applying the trace theorem and an inverse inequality to || Ve|^||L2(cr) via the inequality 

L2(a) < Ch^'^'mL2iT), e L2(T) (3.25) 
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we have 

||Ve|^lU,(.) < CT(7)"-'/^^'^'l|Ve|U,(T). (3.26) 
By the Lipschitz property of A 

PItIU^W = PIUooM < PlU^(r). (3.27) 
By (13.241) . (13.261) . (13.271) and comparability of mesh diameters (12.151) 

II J(e)|U.M < 2CT(7)'-Si/'/^T'^'PIUc.(c..)l|Ve|U,(.,). 
Element T has at most d + 1 interior true-hyperfaces yielding 

||J(e)|U.(aT) < 2(rf+ 1) CT(7)''Sif'/^^'/'P|U^(^,)||Ve|U,(^,) 
= Cjhj, ||A||i^((^j,)|| Ve||L2(^j,). 
Putting together the terms from C and from the jump residual, 
V{v^ T) < 7]{w, T) + hr (||divA|U^(r) + Cih^^ P|U^(t) 

+ I|c||loc(t) + II^IUocH) l|e||Hi(T) + hj. Cjhj, ||v4||l^(;^j,) ||e||Hi(a;T) 
< ri{w,T) + CTOT'riT{D,T)\\v - wWm^ur) 
where Ctot' differs by a factor of 2 for p = 1, 2. □ 

3.4. Estimator reduction. We now establish one of the three key results we need, 
namely estimator reduction. This result is a minor variation of H| Corollary 2.4 and 
is stated here for completeness. 

Theorem 3.4 (Estimator reduction). Let the problem data satisfy Assumption 12. i I and 
the mesh satisfy conditions (1) and (2) of Assumption \272\ Let 7i G T, C 7i and 
Ti = REFINE{Ti, M). Forp=l let 

Ai := {d + 2fK\mf and A := (1 - 2-^/2^)2 > g 

and for p = 2 let 

Ai := {d + 2)Almf and A := 1 - 2"^/'^ > 
with Ai from \33\ ( Local Perturbation). Then for any vi G Vi and V2 G V2 and 5 > 

r]l{v2.T2) <(1 + 5) [rfM^Ti) - Xri{vi,M)] + {I + 5~^)Kiri\\v2 - v^f . (3.28) 

Proof. The proofs for p = 1 and p = 2 are similar. For p = 1 it is necessary to sum over 
elements before squaring and for p = 2 square first then sum over elements. 
Proof for the case p = 1. By the local Lipschitz property (13.121 ) 

yi2{v2,T) < r]2{vi,T) + Kir]2{D,T)\\v2 - Vi\\m^^^). (3.29) 

Summing over all elements T E T2, the sum of norms over cot covers each element at 
most {d + 2) times as each patch ut is the union of element T and the (up to) d + 1 
elements sharing a true-hyperface with T. Then by the coercivity (12.121 ) over 

V2{V2,T2) <m{vur2) + {d + 2)Aime-^7]l{D,T2)l\v2-vil (3.30) 

Squaring (13.301 ) and applying Young's inequality with constant S to the cross-term, 

Vl{v2,r2) < {l + 6)r,l{v^,T2) + {l + 6^'){d + 2fK\mfr,l{D,T2)\\v2-v^f 

= (1 + 6)r,l{v,, T2) + (1 + 6-^)A,r,l{D, T2)\\v2 - v,f. (3.31) 
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For an element T e M marked for refinement, let T2,t ■= {T' E T2 \ T' C T}. As 
f 1 G Vi has no discontinuities across element boundaries in T2,t, we have J(t'i) = on 
true hyperfaces in the interior of 72,r- 

Recall the element diameter hx = \T\^^'^. For an element T marked for refinement, T' 
must be a proper subset of T, in particular a product of at least one bisection so that 

IT'I < -ITI ^ iTy/'^ < 47:T|r|^/'^ ^ hr' < -^hr- (3.32) 

Then 

^ ri2{yiX)< hT'\\R{Vi)\\L2{T') + h}l?\\J{v)\\L2(&T'ndT) 

T'eT2,T T'eT2,T T'£T2,T 

L2(dT) 

T'€T2.T 

< 2~'/'' {hT\\R{v,)h2iT) + h]('\\J{v)h,^eT)] 
= 2-1/2 (333) 
For an element T ^ Ai, that is T' = T the indicator is reproduced 

r]2ivuT')=r],{vuT). (3.34) 

Sum over all T G Ti by estimates (13.331) . (13.341 ) writing the sum of indicators over the 
7i \ as the total estimator less the indicators over the refinement set Ai. Let the 
refined set 7^ := {T G 7^ | T' C T for some f e M} then 

V2{vi,T2) = V2{vi,T) 

V2{vi,T) 

TeTz\n Ten 

= Vi{vi,ri)-XiVi{vuM) (3.35) 
where Ai = 1 - 2-i/2d ^ ^ Squaring (l335l) 

rilivu T2) < r,l{vi, Ti) + \\ r,l{vi,M) - 2A? r,l{v^, M) 

= Tjlivi.Ti) - XriliviM) (3.36) 

where A = = (1 - 2~'^^'^'^Y. Applying (13.361) to (13.311) and applying monotonicity of 
the data-estimator 

ril{v2,T2) < (1 + 5) {ril{v,,T{) - \r,l{v,M)) 
+ {l + 6-')Klr,l{D,%)\\v2-v,f. 

The proof for the case p = 2 is similar and may be found in [HI. □ 

3.5. Contraction of AFEM. We now establish the main contraction results. The con- 
traction result [33] is a modification of [4] Theorem 4. 1 . Here we use quasi-orthogonality 
to establish contraction of each of the nonsymmetric problems (|1.3I) and (|2.1I) . 

Theorem 3.5 (GOAFEM contraction). Let the problem data satisfy Assumption 12. 1 1 and 
the mesh satisfy Assumption \2.2\ Let u the solution to (11.31 ). Let 9 G (0, 1], and let 
{Tk,^k, Uk}k>o be the sequence of meshes, finite element spaces and discrete solutions 
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produced by GOAFEM. Then there exist constants 7 > and < a < 1, depending on 
the initial mesh % and marking parameter 9 such that 

Iju - Uk+if + -fVk+i < (111^ - t^klf + IVk) ■ (3-37) 

The analogous result holds for the dual problem with {Tk,Yk, Zk}k>o the sequence of 
meshes, finite element spaces and discrete solutions produced by GOAFEM. 

Proof. Denote 

ek = u- Mfc, Cfc+i = u- Uk+i and Sk = Wfc+i - ^fc- 

Let 

Vk = Vkiuk, Tk), VkiMk) = Vkiuk, Mk) and 77^+1 = r]k+i{uk+i, Tk+i). 
By the result of Estimator Reduction 13 .41 for any 5 > 

riUi < (1 + 5) {r,l - WkiMk)] + (1 + 5-')K,ril\\ekf. 

Multiplying this inequality by positive constant 7 (to be determined) and adding the 
quasi-orthogonality estimate |||efc+i IIP < Mi CfclP — Ill^fclP obtain 

|||efc+if + 77/2^1 < Alle^f - |||£fcf + 7(1 + 5) [vl - WkiMk)] 

+ -l{l + 5-')kiril\\ekf. (3.38) 

Choose 7 to eliminate \sk\ the error between consecutive estimates by setting 

7(1 + ri)Air/2 = 1 ^ ^ = (,^^,^^ 2 ^ 7(1 + ^) = T^- (3-39) 

Applying (l339l) to (l338l) obtain 

|||efc+if + 7r?^+i < A|||efcf + 7(1 + 5)r,l - 7(1 + 5)\r,l{Mk). (3.40) 

By the Dorfler marking strategy 77^ (A^^) > Q'^Vk that 

|||efc+if + 7r/^+i < A|||efcf + 7(1 + 5)r,l - 7(1 + 5)\eW^,. (3.41) 

Split the last term by factors of (5 and (1 — (5) for any /3 G (0, 1) to arrive at 

|||efc+if + Wk+i < n^kf + 7(1 + 5)ril - /37(1 + 5)A^'r7^ 

-{l-P)^{l + 5)\eW^. (3.42) 

Applying the upper-bound estimate (13.1 II ) |||efc|p < Ciifl to the term multiplied by (3 
then by (l339l) 

2^ 2 . , 2 /37(1 + '^)A^^ 2 

|||efc+i||| +7%+i < A|||efc||| |||efc||| + 7(1 + 5)//^ 



A|||efc|||' - /377-r^lllefclir + 7(1 + 8)rii 

-{l-(5)^{l + 5)\eW^ (3.44) 

(a - P^^) ¥kf + 7(1 + 5) (1 - (1 - vl (3.45) 

al{6,mekf + ial{6,P)vl (3.46) 
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where 

:= A - := (1 + 5) (l - (1 - P)Xe') . (3.47) 

OiAi?7q 

Choose 5 small enough so that 

:= max{al,al} < 1. 
To ensure such a 5 exists in light of the quasi-orthogonality constant A > 1 observe 

a?<lwhen5>(A-l)^^ 

and 

al < 1 when ^ < (l - (1 - ' 1 = , [/J^^^l^ 

so to obtain an interval of positive measure where 5 may be found we require 

^ ' (5\e^ 1 - (1 - /3)A^2 
placing a second constraint on the quasi-orthogonality constant 

Ci\i% (1 - (1 - (i)X9^) 

where < /3 < 1 and 6 <1 may be chosen. In order to place bounds on the growth rate 
of the mesh, we further require 6 < 6^, given by (14.51) as discussed in section ^gl □ 

Notice the choice of 5 small enough to satisfy «^ < 1 is always possible, as each term 
may be independently driven below unity by a sufficiently small value of 5, so long as 
the quasi-orthogonality constant A is sufficiently close to one. For a discussion on the 
optimal contraction factor see Remark 4.3 in [4]; see also the discussion in [[8]|. 

3.6. Convergence of GOAFEM. We now derive a bound on error in the goal function. 

Theorem 3.6 (GOAFEM functional convergence). Let the problem data satisfy Assump- 
tion \2.1\ and the mesh satisfy Assumption \2.2\ Let u the solution to (11.31) and z the solution 
to (|2.1I) . Let 6 G (0, 1], and let {Tk, Vfc, Uk, Zk}k>o be the sequence of meshes, finite el- 
ement spaces and discrete primal and dual solutions produced by GOAFEM. Let 7p the 
constant from Theorem \3. 51 applied to the primal problem (|2.22l) and 7^ the constant 7 
from Theorem \3.5\ applied to the dual (12.231) . Then for constant a < 1 as determined by 
Theorem \3.5\ 

\g{u) - g{uk)\ < 2 {a^'' - uof + 7p%'K, 7^)) - IpVlV^^ 
X {a''{l\z-zor + ^,eoi^o,To)) -l.ClV^". 
Proof. On the primal side for all Vk G Yk 

a{u - Uk, Vk) = a{u, Vk) - a{uk, Vk) = f{vk) ~ f{vk) = 0, 
the primal Galerkin orthogonality property. On the dual side, g{u) = a*{z,u) and 
g{uk) = o.*{z, Uk, ) SO that 

g{u) - g{uk) = a*{z, u - Uk) 
= a{u - Uk, z) 

= a{u-Uk,z- Zk). (3.49) 
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Define an inner-product a by tlie symmetric part of a( ■ , ■ ) 

a(f , w) = {AVv, Vw) + {cv, w), 

then 

= a{v,v) = a{v,v), 

and 

a{v, w) = a{v, w) + {b ■ Vf , w). 

Then as «(■, ■) is a symmetric bilinear form on Hilbert space; it is an inner product and it 
induces a norm identical to the energy norm induced by a( ■ , ■ ). As such we may apply 
the Cauchy-Schwarz inequality [6J to a and we're left to handle the convection term. 

a{u -Uk,z - Zk) = a{u -Uk,z - Zk) + {h ■ V{u -Uk),z - z^) 

< l\u - Uklfz - Zkl + {b ■ V(m - Uk), z - Zk). (3.50) 

By Holder's inequality followed by a duality estimate as in ^|6]on the dual error and 
coercivity on the primal, 

{b-V{u-Uk),z- Zk) <\\b\\L^C^hQfiQ jz - Zklfu - Ukl\. (3.51) 

Recalling A = ||&||looC*^o/"o 

a{u — Uk, z — Zk) < \u — ?ifc||| III2; — Zk\ + A|||n — 'Ua,-||| |||-^ — Zk\. (3.52) 
Under assumption (13.61 ) (A < 1) on the initial mesh and from (13.491 ). 

\g{u) - g{uk)\ = \a{u - Uk, z - Zk)\ < 2|||n - Uklfz - Zkj. (3.53) 
From 13.51 there is an « < 1 such that for the primal problem with estimator r]k 

l\u - Uk+i\f < {l\u - Ukf + ipffk) - ipffk^i (3.54) 
and for the dual problem with estimator Ck 

\\z - Zk+if < (Ilk - Zkf + 7dCfc) - IdCl+i- (3.55) 
Iterating, we have from (13.541 ) and (13.551 ) 

\lu - Ukf + iprg < (|||u - Mof + 7p^o) (3-56) 
Ilk - Zkf + IdCl < (Ilk - ^of + 7dCo) • (3.57) 
From (13.531 ), (13.561 ) and (13.571) obtain the contraction of error in quantity of interest 

\g{u)- g{uk)\ <2{a'^{lu- u^f + 7p^?o(wo, %)) - 

X {a'>'{lz-z,f + ^,Coi^,,%))-7,Ciy^\ (3.58) 

or more simply 

\g{u)-g{uk) \ +%Vk + ^dCl < (Ilk - MqIP + 7p^o(«o, 7^) 

+ ||k-zo|||' + 7dCo(^o,ro)) (3.59) 

with Qq the quasi-error on the initial mesh. 



a'^Ql (3.60) 



□ 
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4. Quasi-optimal cardinality of GOAFEM 

In this section we establish the quasi-optimality of GOAFEM. The result in §4.51 fol- 
lows from bounding the cardinality of the marked set for each of the primal and dual 
problems at each iteration as shown in Lemma |4~9l This is achieved by assuming the 
primal and dual solutions belong to appropriate approximation classes as discussed in 
§4.4[ the optimality assumptions addressed in §4.2[ and the supporting results below. 
Under the optimality assumptions, the error-indicator as an upper-bound on energy-error 
as shown in §4. II and a bound for the oscillation term as the mesh is refined as shown in 
§4.21 a suitable reduction in global error between two consecutive iterations implies the 
respective refinement set satisfies the Dorfier property. We address the effect of quasi- 
orthogonality on the necessary reduction to achieve this result. 

The estimator as global lower bound on total error in ^4.11 is used to relate the total- 
error to the quasi-error in §4.5[ connecting the contraction property for the quasi-error 
established in ^to the quasi-optimality of the total error in §4.31 which shows the total 
error satisfies Cea's Lemma. 

4.1. Estimator as global lower bound and localized upper bound. We start with two 
fairly standard results that will be needed in the complexity analysis. The global lower 
bound may be found in [9] Lemma 3.1 and a similar result in [fTOl Proposition 4.3 and 
Corollary 4.4. The localized upper bound is established in Hi Lemma 3.6. 

Lemma 4.1 (Global lower bound). Let the problem data satisfy Assumption 12. 1 1 and the 

mesh satisfy Assumption \2.2\ Let 7i, 72 € T and T2 > Ti a full refinement. Let Uk G 
the solution to (|2.22l) . k = 1,2. Then there is a global constant C2 > such that 

C2v1{uu Ti) < - uif + osc?(mi, Ti). (4.1) 

Lemma 4.2 (Localized upper bound). Let the problem data satisfy Assumption I2.il and 
the mesh satisfy conditions (1) and (2) of Assumption \2.2\ Let 7i, 72 G T with 72 > 7i. 
Let IZ := IZr^^Ti of refined elements. Let Uk G Vfc the solution to (|2.22l) . k = 1,2. 

Then there is a global constant Ci with 

l\u2-ui\f <C,7]liu,,n). (4.2) 

4.2. Optimality assumptions and optimal marking. In this section we consider the 
assumptions on marking parameter 9 and the marking strategy which allow us to char- 
acterize the growth of the adaptive mesh at each iteration with respect to the total error 
in 1431 

We first consider oscillation on the refined mesh, following closely [4], Corollary 3.5. 

Lemma 4.3 (Oscillation on refined mesh). Let the problem data satisfy Assumption 12. 1 1 
and the mesh satisfy condition (1) of Assumption 12. 21 Let 7i, 72 G T with % > Ti. Let 
A2 = Agm^^ with A2 from (13.131) . Then for all Vi G Vi and V2 G V2 

o%ci{vi, Ti n 7^) < 2osc^(t;2, Ti n 7^) + 2A20sc^|||i;i - V2f, (4.3) 

where osCq := osc^^(Z^, %). 

Proof. For all elements T in the intersection T G 7i fl 72 

osci(i;i,T) = osc2(fi,T). 

Applying this, G Vi C V2 and osCj{D,T) < osco(-D,T), j = 1,2, we have 
from(l3J3l) 

0SC2(Wl,T) < OSC2(t;2,7') + A2OSC0II 
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Squaring and applying Young's inequality with e = 1 yields 

osc l{vi,T) < 2osc^(w2,T) + 2A2 

OSCg 1 1 f 1 - f 2 1 1 /fi (T) • (4-4) 

Summing over all T G 7i fl 72 and bounding the norm over 71 fl 72 to the entire domain 
to apply the coercivity estimate (12.91) 

osc^(wi, Ti n 7^) < 2osc^(?;2, Ti n 7^) + 2A20sc^|||t;i - Vif. 

□ 

We now discuss some basic assumptions for complexity analysis. The optimality as- 
sumptions follow those found in [4J with modifications in (14.51) to account for the non- 
symmetric problem, the continuity of A and the goal-oriented method. 

Assumption 4.4 (Optimality assumptions). Assume the following conditions. 

1 ) The marking parameter 6 satisfies 9 G (0, 6**) with 

6* = , , ^ I ? I OA 2T' ^^^^ os'^o = osc^(D, To) (4.5) 

1 + Ci(l + A + 2A2OSC0) 

and A given by (13.61 ). As the data oscillation given by (12.351) is identical for the 
primal and dual problems and the other constants depend only on global data, 9^^ 
may be assumed the same for both the primal an dual problems. 

2) A marked set M.k of optimal cardinality (up to a factor of two) is selected 
(see ifTOl). 

3) The distribution of refinement edges on % satsifies condition (b) of section 4 
in EH. 

We now consider a basic result on optimal marking. This lemma is a variation of 
Lemma 5.9 in [|4|, modified to use quasi-orthogonality 13. ll rather than Galerkin orthogo- 
nality. 

Lemma 4.5 (Optimal marking). Let the problem data satisfy Assumption 12.71 and the 

mesh satisfy Assumption \2.2\ Let 7i,72 G T. Let G the solution to (|2.22l) . 
/c = 1, 2. Let the marking parameter 6 satisfy condition (1) of Assumption \4.4\ 
Let 72 > 7i satisfy 

lu - M2IIP + oscn < - - MilP + osc?) (4.6) 

a 

which implies 

alu - M2IP + 0SC2 < /i {lu - uilp + osc^) (4.7) 

for := i(l — 1^) and a = (1 + A), A G (0, 1) given by (13.61 ) in the quasi-orthogonality 
argument and where 

osci = osci(mi, 71), 0SC2 = osc2(-U2, 7i), and Tji = rji{ui,Ti) ■ 

Then the set IZ := IZq-^^j-^ satisfies the Dorfier property 

r]i{ui,n) > 9r]i{ui,Ti). 

Proof. (See H Lemma 5.9). As < 2/i < 1, multiply inequality (1431) by 1 - 2/i to 
obtain 

(1 - 2y)c2rjl < lu - uilp + osc^ - 2/i - + osc^) . 
Applying (lO) 

(1 — 2/i)c2?7i ^ III'" " ^iIlP ~ otl"^ ~ ^2||P + OSCi — 20SC2. 
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Rearranging terms obtain 

osci — 20SC2 > (1 — 2ii)c2r]\ + Q;|||n — U2\^ — \u — Ui\^. (4.8) 
By the second quasi-orthogonality estimate (13.91) 

(1 + K)\\U - U2f - \\U - Mif > -(1 + A)|||mi - U2f 

where < A < 1. Applying (|43I) 

{I + K)\\u - U2f - \\u - uif > -{l + A)Cir]f{ui,n). (4.9) 
Combining (1491) with (l48l) obtain 

osc^ - 2osc^ > (1 - 2fi)c2vf - (1 + A)Cirif{ui, R). (4.10) 
For refined elements T E TZ use the dominance of the estimator over the oscillation 

oscliu^T) <7]l{uuT). 
For elements T G 7^ n 7^ ([43]) yields 

OSCi(mi, 7i n 72) - 20SC2('U2, 7l n 72) < 2A20SCo|||'Ui - ■U2|||^- 

Then 

oscl{ui,Ti) - 2osc2(m2,7^) < rjliui.TZ) + 2A20SCo|||mi - M2|||^- 
Applying (14.21) to the last term 

osc?(Mi,ri) -2osc^(M2,r2) < (l + 2ClA20scg)r/^(Ml,7^). (4.11) 
Rearranging terms in (|4.1 II) and applying (|4.10l) 

2, ^ (1 - 2iJ.)ciril ^ (1 + A)Cir);(«,. fi) 
^ (l + 2C.A.oscg) ■ 

Combining like terms obtain 

2/ (l-2/i)c2 2 

^^^"^'^^- l + C,(l + A + 2A20scg) ^^- 
Applying the definitions of n and 9^ obtain the result 

□ 

Due to the use of quasi-orthogonality, the required assumption (|4.7I) is stronger than 

— 'U2IIP + 0SC2 < /U (Im — Milp + osc^) 
the condition in for the symmetric problem, but it is also weaker than 

\u — 'U2IIP + OSC2 < — {\u — MilP + OSC?) 

a 

where « = 1 + A > 1, formally similar to the symmetric estimate. We may impose this 
Stronger condition for ease of analysis, however in practice this says that the increase 
in error-reduction we require of the finer mesh needs only come from the energy-norm 
error, not the oscillation. 

We recall a standard result on the mesh overlap, see JH Lemma 3.7. 

Lemma 4.6 (Overlay of meshes). Let the mesh satisfy condition (1) of Assumption 
Let 7i , 72 € T. Then the overlay T = Ti (B T2 is conforming and satisfies 

#r < #ri + #r2 - #ro. 
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4.3. Quasi-optimality of total errror. We show the total error satisfies Cea's Lemma; 
e.g. see [4J Lemma 5.2. This version appropriate for the non-symmetric problem relies 
quasi-orthogonality 13. 1 I rather than Galerkin orthogonality. 

Theorem 4.7 (Quasi-optimality of total error). Let the problem data satisfy Assump- 
tion \2.1\ and the mesh satisfy Assumption \2.2\ Let TJ G T. Let u the solution of (11.31) 
and Ui G Vi the solution of (12.221) . Then there is a constant Co depending on the initial 
mesh To cind the problem data D such that 

\\u - uif + osc?(mi, Ti) < Cd inf - vf + osc?(t;, Ti)) . (4.12) 
Proof. For e > choose G Vi with 



\\u — 



"2 + osc?(w„ Ti) < (1 + e) inf - vf + osc?(t;, Ti)) 



By (1431) with Ti = 71 obtain 

osc?(i;i, Ti) < 2osc?(t;„ Ti) + 2A20scg|||ui - v^f. (4.13) 
By the same reasoning as (|3.1I) obtain 

III III 2 I III ll|2 ^ A III III 2 

— Mil fell <A||m — fell 

which implies 

- Ml IIP < A||f - felp and ||mi - v^f < A|||f - v^f. (4.14) 
From (14.131) and (14.141) obtain 

||m — flip + OSC^(f 1, 7i) < A||f — fel^ + 20SC^(fe, 7i) + 2A20SCo|||f 1 — felP 

< A (l + 2A2OSC0) Iff - felf + 20SCi(fe, Tl). 

Set Cd ■■= max{2, A (1 + 2A20sc^)} then 

|||f - flip + osCi(f 1, Tl) < Cd (|||m - v^f + osci(fe, Ti)) 

< Cd{1 + e) inf (||f - f IP + osc?(f , Ti)) . 

Letting e — )■ establishes the result. □ 

4.4. Approximation classes and approximation property. For problem with solution, 
forcing function and data (f , /, D) and dual problem (2, g, D*), membership in an ap- 
propriate approximation class says the solution u (respectively z) may be approximated 
within a given tolerance by finite element approximation while the cardinality of the 
mesh required to achieve the tolerance satisfies (14.181) . 

For > let Tat the set of conforming triangulations generated from the initial mesh 
To such that the increase in cardinality is at most 

Tjv := {reT\#r-Wo<N}. 

For s > define the standard approximation classes for solutions based on the energy 
error 



A:=<fGV supfA^Mnf inf |||f - fr||| < 00 ^ (4.15) 

' ' Ar>0 rgT^^^)r6Vr ' 



and for L2 data 



As:={9e sup (AT^ inf \\h{g - lii^.M^n)) <oo\. (4.16) 

N>0 V STat 
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Define a measure of approximation based on the total error 

a{N- V, f, D) := inf inf - vrf + os4(t;^, T)) 

and denote the total error of vr G V7- by 

E{v,r) := (|||i;-t;r|ir + osc^(t;r,r))'/' 
and the approximation class based on the total error for s > 



As := {v, f, D) \v, f, DU := sup(iVV(iV; v, f,D)) < 00 \. (4.17) 

See [Hll Lemma 5.3 and Lemma 5.4 for a discussion on the relation between the classes 
As, As and As- The results in this paper are developed with respect to the class based 
on the total error. 

Membership of the primal and dual solutions in the approximation classes and A^ 
is applied via the use of the two properties discussed in this section. 

Lemma 4.8 (Approximation property). Let the mesh satisfy condition (1) of Assumption 
\2.2\ Let u the solution to (11.31 ). Assume u E As and cr{l;u, f, D) > 0. Then given e > 
there is a global constant C depending only on the initial mesh % and the problem data 
D, a partition % E T and av^E Yj-^ such that 

C\uj;D\s>{Wo-rsre (4.18) 

E{v,,re)<e. (4.19) 

Proof. By (14.171) and property of the supremum, for any > 

\u,f,D\s>N'a{N;u,f,D) (4.20) 

where N = i^T - #7^. Given e > consider all > such that a{N] u, f, D) > s. If 
there is no such A^, let A^. = L By (|4.20l) 

|m, /, -D|s > o"i = —e where ai := a{l; u, /, D). 

Applying the assumption cr(l; m, f,D)>0 

e 

establishing (14.181) with C = e/ai. Also 



u,f,D\s > e 



a{l;u,f,D)=mf inf E{v,T) < e 

T gTi uGVrj 

SO there is 7^ G Ti and E Y-ji so that E(vs, Te) < e establishing (14.191) . Otherwise, 
there is A^ > with a[N\ u, f, D) > e. As the infimum over the total error goes to zero 
as A^ — > 00 this holds for finitely many A^ so define 

K ■=md.^{N >Q\a{N-uJ,D)>e}. (4.21) 

By (14301) and (|43T]) 

|u, f,D\s> K'a{K; u, /, D) > K'e. (4.22) 

Let A^, = 2K. 

\uJ,D\s>K'e = 2-'N:e =^ C\uJ,D\s>N:e 

with C = 2'' establishing (14.181) . By (14.211) and property of the infimum with > K 

aiN,-u,f-D)= mi inf E{v,r) < mf mfE{v,T)<e 
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implying E^v^, %) < e for some 7^ G T,. and a G V7; establishing (14.191) . □ 

4.5. Cardinality of Mk and quasi-optimality of the mesh. The results on the cardi- 
nality oi M.k and quasi-optimality are variations on [4] Lemma 5.10 and Theorem 5.11. 
Here we address the goal-oriented method discussed in l2.3[ 

Lemma 4.9 (Cardinality of M.k)' Let the problem data satisfy Assumption 12. 1 1 and the 
mesh satisfy Assumption \2.2\ Assume conditions (1) and (2) of Assumption \4.4\ Let u 
the solution of (11.31) and z the solution of (12.11) . Let {%,Yk,Uk, Zk}k>o the sequence 
of meshes, finite element spaces and discrete primal and dual solutions produced by 
GOAFEM. If{u, f, D) G and (z, g, D*) G At we have 

#A1.<2c|(l + A)V2^(^l_^^ ^ \uJ,D\l/^C]/''E-'/\uk,rk) 

+ (1 + A)^/^* \z,g,D*\y'cTEk'^\zk,rk) (4.23) 

where Co is the constant from (|4.12l) and the total errors in the primal and dual problems 

Eliuk, Tk) ■■= \\u - Uk\f + osc^(Mfc, Tk) 
Elizk, Tk) := l\z - Zk\f + oscKzk, Tk). 

Proof Set ft = 1(1- (1 + A)-^ with A given by (Il6l) . 

el := p,C;[j^El{uk, Tk), and := jiC]j'El{zk, Tk). 

As (m, /, D) G Kg, by the properties in section |4~4] there is a 7^ G T and a f p G ¥7^ such 
that 

#rp - #ro < /, D\\'^e-^^'' (4.24) 
- Vpf + osc^^(i;p, Tp) < 4. (4.25) 
Similarly for (z, g, D*) G At, there is a 7d G T and a u;^^ G ¥7^ such that 

#r, - #ro < C\z,g,D*\y's,'^' (4.26) 
Ilk - Wdf + osc^^(wrf, Td) < 6% (4.27) 

Let T2 '■= Tk (B (Tp © Td) as in Lemma |46l Let U2 G V2 the Galerkin solution to (12.221) 
and Z2 G V2 the respective solution to (12.231 ) . See there is a reduction in the total error 
by a factor of jl from Uk to U2 (respectively Zk to Z2). Since T2 > Tphy Theorem |4.7[ 
monotonicity of infimum over total error and (14.251 ) 

i^^ - «2||P + 0SC2(U2, T2) < Cd inf (|||u - vf' + OSC2(f,7^)) 

DeV2 
<CDel 

= fi {ju - Ukf + oscliuk, Tk)) . (4.28) 

Similarly for the dual problem 

Ilk - + osc2(-22, T2) <fi {l\z - Zk\f + oscKzk, Tk)) • (4.29) 

This satisfies the hypothesis (14.61) in each problem so applying 14.21 the refining subset 
TZ := TZTk^Tz C Tk satisfies the Dorfier property for 9 < 9^. The marking procedure 
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selects a subset for marking Mk C Tk of minimal cardinality up to a factor of two so 
that by Lemma |4~6l 

#-Mfc < 2#7^ < 2(#r2 - #r,) < 2 {(#rp - #ro) + (#r, - #ro)} . (4.30) 

By (14.301) . (14.241) . the definition of Sp and erd, (14.281) and the definition of /x 

< 2 {(#rp - #ro) + - #ro)} 

< 2C { |«, /, + \z, g, D*\y'e-/^'} 

= 2chl + A)^/- (^1 - y It., /, Diy^Ci/^^E, ^/^K, Tk) 

□ 

Theorem 4.10 (Quasi-optimality). Let the problem data satisfy Assumption 12. 1 1 and the 

mesh satisfy Assumption 12. 21 Let Assumption \4~4\ be satisfied by GOAFEM. Let u the so- 
lution of (|1.3I) anJ 2; solution of (|2.1I) . Le? {7^, Vfc, w^, -2fc}fc>o sequence of meshes, 
finite element spaces and discrete primal and dual solutions produced by GOAFEM. Let 

{u, /, D) G As and {z, g, D*) G A^. Then 

AfJl + ^j Qk^^^iuk^Tk) 

Proof. Let the total error in primal and dual problems Ek{uk, Tk) and Ek{zk, Tk) as in 
Lemma 149] Denote the quasi-error in each problem by 

Ql{uk, Tk) ■= Iju - Mfcif + -fp7]l{uk, Tk), 
Qlizk, Tk) ■■= l\z - Zk\f + idCl{zk, Tk). 
As shown in Theorem 2.4 there is a global constant Cj which satisfies 

fc-i 

#rfc - #ro <CfY, for all A; > 1 

and by (14331) 

< 2C M + A)i/2^ (^1 - - j In, /, D\l/^C];''E,'/'{uk, Tk) 

/ /l2\-l/2t 



then we have 

k-l k-1 

H^Tk - #ro < MpY,Ek{uk,Tk)-"' + Mrf5^Efc(zfc,rfe)-^/* (4.31) 
i=o i=o 
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with the constants 

M, := 2CfC{l + A) ^1 - _ j |«, /, D\]'^C]i'' 

From the domination of the error estimator over the oscillation and the lower bound on 
total error (14.11) we have the equivalence of the total-error and quasi-error 

111^ " ""ilir + Ip^^^^i'^ji 7j) < III'" - ^ilir + IpV'jiuj, Tj) 

< (^l + :^^E\u„rj). (4.32) 

or 

Ej'^'iu,, 7^) < + Qj'^\u„ T,) (4.33) 

and similarly for the dual problem 

E-'^\z„ 7-) < (^1 + Qj'/\z„ 7-). (4.34) 

By the contraction result on the quasi-error (13.371) for < j < A; — 1 

Qliuk, %) < o?'^^^^^Q]{u,, %) and Ql^z^, %) < a^^^-^^Qf (z,-, V"). (4.35) 
Putting together (I4.31L (14.331) and (14.351) obtain 

k-l k-l 

j=0 j=0 
l/2s 



where the geometric series in a; < 1 is bounded by S{9) = q;^/'*(1 — a^^^)^^. Then 



{/ \ l/2s 



<s{e) 



{/ \ l/2s 

Mp M (|||M-M,,f + 7pOSc2(Mfc,rfc))' 



-l/2s 



/ \ l/2t 



As seen in (14.321) the total error and quasi-error are equivalent up to a constant so this 
result may be viewed with respect to either the quasi- or total-error. □ 
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5. Conclusion 

In this article we developed convergence theory for a class of goal-oriented adaptive 
finite element methods for second order nonsymmetric linear elliptic equations. In par- 
ticular, we established contraction and quasi-optimality results for a method of this type 
for the elliptic problem (ll.ll) - (ll.2l) with A Lipschitz, almost-everywhere (a.e.) symmet- 
ric positive definite (SPD), with b divergence-free, and with c > 0. We first described 
the problem class in some detail, with a brief review of conforming finite element dis- 
cretization and error-estimate-driven adaptive finite element methods (AFEM). We then 
described a goal-oriented variation of standard AFEM (GOAFEM). Following the recent 
work of Mommer and Stevenson [10] for symmetric problems, we established contrac- 
tion of GOAFEM. We also showed convergence in the sense of the goal function. Our 
analysis approach was signficantly different from that of Mommer and Stevenson [fTOl , 
and involved the combination of the recent contraction frameworks of Gascon et. al JH, 
Nochetto, Siebert, and Veeser [1 1 J, and of Hoist, Tsogtgerel, and Zhu [8J. We also did a 
careful complexity analysis, and established quasi-optimal cardinality of GOAFEM. 

Problems that were not yet addressed include allowing for jump discontinuities in the 
diffusion cofficient, and allowing for lower-order nonlinear terms. We will address both 
of these aspects in a future work. 

6. Appendix 

Duality. We include an appendix discussion of the duality argument used in the quasi- 
orthogonality estimate in an effort to make the paper more self-contained. 

Let u the variational solution to (11.31 ) and mi G Vi the Galerkin solution to (12.221 ). 
Assume for any g E L2(fi) the solution w to the dual problem (12.11 ) belongs to H'^iVt) fl 
Hl{n) and 

\w\H^m < ^i?||5'||L2(n)- (6.1) 

Then 

11"" ~ ""11^2 ^ C*/;,o|||ii — (6.2) 
IfwE n Hq^D.) but w ^ H'^{il) due to the angles of a nonconvex polyhedral 

domain then w G H^'^'^ for some < s < 1 where s depends on the angles of d^l. 
Assume in this case for any g E L2 

\u!\m+s^n) < Kji\\g\\L^(Q) (6.3) 

then 

||w — mi||l2 < C'^olll'" ~ ""ill- (6-4) 
As discussed in [ISl, H and fl\ the regularity assumptions are reasonable based on the 
continuity of the diffusion coefficients aij and the convection and reaction coefficients 6j 
and c in Loo(f^). 

Proof of (16.21) ." The proof follows the duality arguments in [T| and |l3l. 
Let w E Hq(^1) the solution to the dual problem 

a*{w,v) = {u - ui,v), veHq{Q). (6.5) 

Let X'^ a global interpolator based on refinement TJ. Assume Z'^w is C° and the 
corresponding shape functions have approximation order m. For m = 2 

\\w — X'^w||//i < Cxhri\w\H2 (6.6) 

As discussed in [[Tl the interpolation estimate over reference element T follows from 
the Bramble-Hilbert lemma applied to the bounded linear functional f{u) = {u—I^u, v) 
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where v G H^{T) is arbitrary then set to u — X^u. The Sobolev semi-norms for t = 0, 1 
over elements T E T are bounded via change of variables to the reference element. 
Summing over T E T and combining semi-norms into a norm estimate establishes (|6.6I) . 
By (16.11) we have the bound 

\w\h2 < Kr\\u - ui\\l2- (6.7) 
By the identity a{v, y) = a*{y, v) write the primal form of the variational problems 

a{u,v) = f{v), veH'.iQ) (6.8) 
a{ui,v) = f{v), ve¥i (6.9) 
a{v,w) = {u -ui,v), f e -ffo(fi). (6.10) 

Taking v = u — Ui E Hq in (16.101 ) 

a{u — ui,w) = {u — ui,u — ui) = \\u — uiW'j^^. (6.1 1) 

Combining (16.81) and (16.91) we have the Galerkin orthogonality result 

a{u-ui,v) = 0, veYi. (6.12) 

Then by (|6.1 II) and (16.121) noting the interpolant of the dual solution Z'^w E Vi 

II M — Ml 11^2 = o.i'^ — ui,w) = a{u — ui,w — X^w). (6.13) 

Starting with (16.131) and applying continuity (12.81) . interpolation estimate (16.61) and 
elliptic regularity (16.71) 

ll"" ~ ^l|li2 — ^c^u — ^ii||//i||u^ —X^w'^H'^ 

< Mc\\u — ui WmCxhj-^ \w\h'^ 

< KuMcCxhoWu - uiWhAW - ^ilU2- 



Canceling one factor of ||m — ui||l2 and applying coercivity (12.91) 

||n — ni||i2 ^ — CxK^hQfu — ui\. (6.14) 
ms 

Depending on the regularity of the boundary dVt the solution w may have less regu- 
larity: w E -f^ioc(f7) but w ^ H'^iVt). In particular, we may have w E H^^'^ for some 
s e (0, 1). In that case obtain the more general estimate 



\\w - X''w||//i < Cxhl\w\ 

yielding 



l+s 



\\u - ui\\l2 < —CxKRhQlu - uij. 

The value of s is found by considering all comers of boundary dVl. Writing the interior 
angle at each corner by w = vr/a it holds for a > and arbitrary e > 

uj = 7r/a =^ WE H^+''-' 

and if 71 / {pj + 1) < cu < tt/pj for a set of integers pj characterizing the comers of dil 

\\w - X^w\\h^ < Ch^'lwli+s 

where s = min{pj, 1} and s = 1 in the case of a smooth boundary or a convex polyhedral 
domain. Details may be found in [IJ and [il3il . 
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